Learning Realistic Patterns from Visually Unrealistic Stimuli: Generalization and Data Anonymization
نویسندگان
چکیده
Good training data is a prerequisite to develop useful Machine Learning applications. However, in many domains existing sets cannot be shared due privacy regulations (e.g., from medical studies). This work investigates simple yet unconventional approach for anonymized synthesis enable third parties benefit such data. We explore the feasibility of learning implicitly visually unrealistic, task-relevant stimuli, which are synthesized by exciting neurons trained deep neural network. As such, neuronal excitation can used generate synthetic stimuli. The stimuli train new classification models. Furthermore, we extend this framework inhibit representations that associated with specific individuals. use sleep monitoring both an open and large closed clinical study, Electroencephalogram stage data, evaluate whether (1) end-users create successfully customized models, (2) identity participants study protected. Extensive comparative empirical investigation shows different algorithms on able generalize same task as original model. Architectural algorithmic similarity between models play important role performance. For similar architectures, performance close using Accuracy difference 0.56%-3.82%, Kappa coefficient 0.02-0.08). Further experiments show provide state-ofthe-art resilience against adversarial association membership inference attacks.
منابع مشابه
Realistic and Unrealistic Expectations
DURING THE LAST FEW YEARS considerable interest has developed in the possibility that some of the myocardial damage consequent to coronary occlusion may be averted.' To explore this possibility there is a critical need for techniques capable of testing the hypothesis that a given intervention is capable of protecting ischemic myocardium. Since it has long been appreciated that myocardial ischem...
متن کاملGeneralization-Based k-Anonymization
Microaggregation is an anonymization technique consisting on partitioning the data into clusters no smaller than k elements and then replacing the whole cluster by its prototypical representant. Most of microaggregation techniques work on numerical attributes. However, many data sets are described by heterogeneous types of data, i.e., numerical and categorical attributes. In this paper we propo...
متن کاملGeneralization-Based k-Anonymization
Microaggregation is an anonymization technique consisting on partitioning the data into clusters no smaller than k elements and then replacing the whole cluster by its prototypical representant. Most of microaggregation techniques work on numerical attributes. However, many data sets are described by heterogeneous types of data, i.e., numerical and categorical attributes. In this paper we propo...
متن کاملGeneralization-Based k-Anonymization
Microaggregation is an anonymization technique consisting on partitioning the data into clusters no smaller than k elements and then replacing the whole cluster by its prototypical representant. Most of microaggregation techniques work on numerical attributes. However, many data sets are described by heterogeneous types of data, i.e., numerical and categorical attributes. In this paper we propo...
متن کاملAnonymization of Set-Valued Data via Top-Down, Local Generalization
Set-valued data, in which a set of values are associated with an individual, is common in databases ranging from market basket data, to medical databases of patients’ symptoms and behaviors, to query engine search logs. Anonymizing this data is important if we are to reconcile the conflicting demands arising from the desire to release the data for study and the desire to protect the privacy of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Artificial Intelligence Research
سال: 2021
ISSN: ['1076-9757', '1943-5037']
DOI: https://doi.org/10.1613/jair.1.13252